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SEQUENCE LISTING 



(1) GENERAL INFORMATION: 

(i) APPLICANT: Kraus, Matthias H. 

Aaronson, Stuart A. 

(ii) TITLE OF INVENTION: AN ISOLATED POLYPEPTIDE RELATED TO THE 
EPIDERMAL GROWTH FACTOR RECEPTOR, ANTIGEN THERETO, AND 
BIOASSAYS AND METHODS RELATED THERETO 

(iii) NUMBER OF SEQUENCES: 12 

(iv) CORRESPONDENCE ADDRESS: 

(A) ADDRESSEE: Suite 400 

(B) STREET: 133 Carnegie Way, N.W. 

(C) CITY: Atlanta 

(D) STATE: Georgia 

(E) COUNTRY: U.S.A. 

(F) ZIP: 30303 

(v) COMPUTER READABLE FORM: 

(A) MEDIUM TYPE: Floppy disk 

(B) COMPUTER: IBM PC compatible 

(C) OPERATING SYSTEM: PC-DOS/MS-DOS 

(D) SOFTWARE: Patentln Release #1.0, Version #1.25 

(vi) CURRENT APPLICATION DATA: 

(A) APPLICATION NUMBER: 

(B) FILING DATE: 

(C) CLASSIFICATION: 

(vii) PRIOR APPLICATION DATA: 

(A) APPLICATION NUMBER: US 07/444,406 

(B) FILING DATE: 01-DEC-1989 

(viii) ATTORNEY/AGENT INFORMATION: 

(A) NAME: Perryman, David G. 

(B) REGISTRATION NUMBER: 33,438 

(C) REFERENCE/DOCKET NUMBER: 1414-028 

(ix) TELECOMMUNICATION INFORMATION: 

(A) TELEPHONE: (404) 688-0770 

(B) TELEFAX: (404) 688-9880 

(2) INFORMATION FOR SEQ ID N0:1: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1542 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
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(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 66.. 221 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 780.. 855 

(ix) FEATURE: 

(A) NAME/KEY: exon 

(B) LOCATION: 1040.. 1185 

(ix) FEATURE: 

(A) NAME/KEY: intron 

(B) LOCATION: 222.-779 '. 

(ix) FEATURE: 

(A) NAME/ KEY: intron 

(B) LOCATION: 856.. 1039 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: join(66. .221, 780.. 855, 1040.. 1185) 
(xi) SEQUENCE DESCRIPTION: SEQ ID N0:1: 

GAATTCCAGA TCTCAGTGAC TGATTCCCCC AACCTTAAGA ATACTTTCTT CCCCTATACC 60 

TACAG GGA ATG TAC TAC CTT GAG GAA CAT GGT ATG GTG CAT AGA AAC 107 
Gly Met Tyr Tyr Leu Glu Glu His Gly Met Val His Arg Asn 
1 5 10 

CTG GCT GCC CGA AAC GTG CTA CTC AAG TCA CCC AGT CAG GTT CAG GTG 155 
Leu Ala Ala Arg Asn Val Leu Leu Lys Ser Pro Ser Gin Val Gin Val 
15 ~ 20 25 30 

GCA GAT TTT GGT GTG GCT GAC CTG CTG CCT CCT GAT GAT AAG CAG CTG 203 
Ala Asp Phe Gly Val A>a Asp Leu Leu Pro Pro Asp Asp Lys Gin Leu 
35 40 45 

CTA TAC AGT GAG GCC AAG GTGAGGAGAC ACAAAGGGTA AGGAGGCGGG 251 
Leu Tyr Ser Glu Ala Lys 
50 

GGTGGAGTGA AGCATGGGGA TAGGGAGCAG CCAGTGGTCT CTTCCAGAGG CAAGCAGATG 311 

CTTCATGGTA AGTTCAAGGA GAGAAGGCTG CAGATGCCAG ATATTTTAGT TCAGAGGGCA 371 

ACAAAGAAAA TAATGATCAA GAACTTGGGA CTGGCCGGGC GCGGTGGCTC ACGCCTGTAA 431 

TCCCAACACT TCGGGAGGCC AAGGCGGGTG GATCACAAGG TCAGGAGATC AAGACCATCC 491 

TGGCTAGCAC GGTGAAACCC CGTCTCTACT AAATATACAA AAAAAAAAAA ATTAGCCAGG 551 

CGTGGCGGCA TGCATCTGTA CTCCCAGCTA CTCGGGAGGC TGAGGCAGGA GAATGGCGTG 611 
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AACCCAGGAG GCGGAGCTTG CAGTGGGCCG AGATCGCACC ACTGCACTCC AGTCTGGGCG 671 

ACAGAGCGAG ACTCCGTCTC AAAAAAAAAA AAAAAAGAAT TTGGGACTTG GAAATCCTAA 731 

GAAAATTTGT GGAAATAAAC TTGTGATACC TCTATCTTTA ATCCGCAG ACT CCA ATT 788 

Thr Pro He 
55 

AAG TGG ATG GCC CTT GAG AGT ATC CAC TTT GGG AAA TAC ACA CAC CAG 836 
Lys Trp Met Ala Leu Glu Ser He His Phe Gly Lys Tyr Thr His Gin 
60 65 70 

AGT GAT GTC TGG AGC TAT G GTCAGTGCAT CTGGATGCCC TCTCTACCAT 885 
Ser Asp Val Trp Ser Tyr 

75 ' 

CACTGGCCCC AGTTTCAAAT TTACCTTTTG AGAGCCCCCT CTTAGAATCT CTAAGCACTT 945 

CAGATTTTTG TGTTAGATCA GGTTCTGCCT TCCCTTCACT TCATGCCCAT GTCTACTATT 1005 

TTGCCAGTGA CTAGTCCATG TCTTCCTGCA ACAG GT GTG ACA GTT TGG GAG 1056 

Gly Val Thr Val Trp Glu 
80 

TTG ATG ACC TTC GGG GCA GAG CCC TAT GCA GGG CTA CGA TTG GCT GAA 1104 
Leu Met Thr Phe Gly Ala Glu Pro Tyr Ala Gly Leu Arg Leu Ala Glu 
85 90 95 

GTA CCA GAC CTG CTA GAG AAG GGG GAG CGG TTG GCA CAG CCC CAG ATC 1152 
Val Pro Asp Leu Leu Glu Lys Gly Glu Arg Leu Ala Gin Pro Gin He 
100 105 ' 110 115 

TGC ACA ATT GAT GTC TAC ATG GTG ATG GTC AAG TGTGAGTTAC CTGCTGAGCC 1205 
Cys Thr He Asp Val Tyr Met Val Met Val Lys 
120 125 

CAACCATTTT CTCTTTTTTT CTTTTTTTTT CTTTTTTTTT TTTTTTTGAG ACAGAGTCTC 1265 

ACAATTGTCA CCCAGGCTGG AGTGCAATGG TGCAATCAAT CTTGGCTCAC TACAACCTCC 1325 

GCCTCTCGGG TTCAAGAGAT TCTCCTGCTT CAGCTCCGGA GTAGCTGGGA TTACAGCGCC 1385 

CGCCACACCT GGATAACTGT TACACTTTTA GTAGAGATGG GGTTTCACCA TGTTGGCCAG 1445 

GCTGGTCTCA AACTCCTGAC CTCAGGTGAT CCGCCTGCCT CAGCTTCCCA AAGTGCTGGG 1505 

ATTACAGGTG TGAGCCATCA TGCTCGCCTG ACTGCAG 1542 

(2) INFORMATION FOR SEQ ID N0:2: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 126 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 
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(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:2: 

Gly Met Tyr Tyr Leu Glu Glu His Gly Met Val His Arg Asn 
15 10 

Ala Arg Asn Val Leu Leu Lys Ser Pro Ser Gin Val Gin Val 
20 25 30 

Phe Gly Val Ala Asp Leu Leu Pro Pro Asp Asp Lys Gin Leu 
35 40 45 

Ser Glu Ala Lys Thr Pro He Lys Trp Met Ala Leu Glu Ser 
50 55 60 

Phe Gly Lys Tyr Thr His Gin Ser Asp Val Trp Ser Tyr Gly 
65 70 75 

Val Trp Glu Leu Met Thr Phe Gly Ala Glu Pro Tyr Ala Gly 
85 90 

Leu Ala Glu Val Pro Asp Leu Leu Glu Lys Gly Glu Arg Leu 
100 105 110 

Pro Gin He Cys Thr He Asp Val Tyr Met Val Met Val Lys 
115 120 125 

(2) INFORMATION FOR SEQ ID N0:3: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 4905 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: cDNA 

(ix) FEATURE: 

(A) NAME/KEY: CDS 

(B) LOCATION: 100.. 4125 



(xi) SEQUENCE. DESCRIPTION: SEQ ID N0:3: 

ACCAATTCGC CAGCGGTTCA GGTGGCTCTT GCCTCGATGT CCTAGCCTAG GGGCCCCCGG 60 

GCCGGACTTG GCTGGGCTCC CTTCACCCTC TGCGGAGTC ATG AGG GCG AAC GAC 114 

Met Arg Ala Asn Asp 
1 5 

GCT CTG CAG GTG CTG GGC TTG CTT TTC AGC CTG GCC CGG GGC TCC GAG 162 
Ala Leu Gin Val Leu Gly Leu Leu Phe Ser Leu Ala Arg Gly Ser Glu 
10 15 20 



Leu Ala 
15 

Ala Asp 
Leu Tyr 
He His 



Val Thr 
80 

Leu Arg 
95 

Ala Gin 



i 



52 

GTG GGC AAC TCT CAG GCA GTG TGT CCT GGG ACT CTG AAT GGC CTG AGT 210 

Val Gly Asn Ser Gin Ala Val Cys Pro Gly Thr Leu Asn Gly Leu Ser 
25 30 35 

GTG ACC GGC GAT GCT GAG AAC CAA TAG CAG ACA CTG TAC AAG CTC TAC 258 
Val Thr Gly Asp Ala Glu Asn Gin Tyr Gin Thr Leu Tyr Lys Leu Tyr 
40 45 50 

GAG AGG TGT GAG GTG GTG ATG GGG AAC CTT GAG ATT GTG CTC ACG GGA 306 
Glu Arg Cys Glu Val Val Met Gly Asn Leu Glu lie Val Leu Thr Gly 
55 60 65 

CAC AAT GCC GAC CTC TCC TTC CTG CAG TGG ATT CGA GAA GTG ACA GGC 354 
His Asn Ala Asp Leu Ser Phe Leu Gin Trp He Arg Glu Val Thr Gly 
70 75 80 85 > 

TAT GTC CTC GTG GCC ATG AAT GAA TTC TCT ACT CTA CCA TTG CCC AAC 402 
Tyr Val Leu Val Ala Met Asn Glu Phe Ser Thr Leu Pro Leu Pro Asn 
90 95 100 

CTC CGC GTG GTG CGA GGG ACC CAG GTC TAC GAT GGG AAG TTT GCC ATC 450 
Leu Arg Val Val Arg Gly Thr Gin Val Tyr Asp Gly Lys Phe Ala He 
105 " 110 115 

TTC GTC ATG TTG AAC TAT AAC ACC AAC TCC AGC CAC GCT CTG CGC CAG 498 
Phe Val Met Leu Asn Tyr Asn Thr Asn Ser Ser His Ala Leu Arg Gin 
120 125 130 

CTC CGC TTG ACT CAG CTC ACC GAG ATT CTG TCA GGG GGT GTT TAT ATT 546 
Leu Arg Leu Thr Gin Leu Thr Glu He Leu Ser Gly Gly Val Tyr lie 
135 140 145 

GAG AAG AAC GAT AAG CTT TGT CAC ATG GAC ACA ATT GAC TGG AGG GAC 594 
Glu Lys Asn Asp Lys Leu Cys His Met Asp Thr He Asp Trp Arg Asp 
150 155 160 165 

ATC GTG AGG GAC CGA GAT GCT GAG ATA GTG GTG AAG GAC AAT GGC AGA 642 
He Val Arg Asp Arg Asp Ala Glu He Val Val Lys Asp Asn Gly Arg 
170 175 180 

AGC TGT CCC CCC TGT CAT GAG GTT TGC AAG GGG CGA TGC TGG GGT CCT 690 
Ser Cys Pro Pro Cys His Glu Val Cys Lys Gly Arg Cys Trp Gly Pro 
185 190 195 

GGA TCA GAA GAC TGC CAG ACA TTG ACC AAG ACC ATC TGT GCT CCT CAG 738 
Gly Ser Glu Asp Cys Gin Thr Leu Thr Lys Thr He Cys Ala Pro Gin 
200 205 210 

TGT AAT GGT CAC TGC TTT GGG CCC AAC CCC AAC CAG TGC TGC CAT GAT 786 
Cys Asn Gly His Cys Phe Gly Pro Asn Pro Asn Gin Cys Cys His Asp 
215 220 225 

GAG TGT GCC GGG GGC TGC TCA GGC CCT CAG GAC ACA GAC TGC TTT GCC 834 
Glu Cys Ala Gly Gly Cys Ser Gly Pro Gin Asp Thr Asp Cys Phe Ala 
230 235 240 245 



I 
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TGC CGG CAC TTC AAT GAC AGT GGA GCC TGT GTA CCT CGC TGT CCA CAG 882 
Cys Arg His Phe Asn Asp Ser Gly Ala Cys Val Pro Arg Cys Pro Gin 
250 255 260 

CCT CTT GTC TAC AAC AAG CTA ACT TTC CAG CTG GAA CCC AAT CCC CAC 930 
Pro Leu Val Tyr Asn Lys Leu Thr Phe Gin Leu Glu Pro Asn Pro His 
265 ~ 270 275 

ACC AAG TAT CAG TAT GGA GGA GTT TGT GTA GCC AGC TGT CCC CAT AAC 978 
Thr Lys Tyr Gin Tyr Gly Gly Val Cys Val Ala Ser Cys Pro His Asn 
280 285 290 

TTT GTG GTG GAT CAA ACA TCC TGT GTC AGG GCC TGT CCT CCT GAC AAG 1026 
Phe Val Val Asp Gin Thr Ser Cys Val Arg Ala Cys Pro Pro Asp Lys 

295 300 305 i 

ATG GAA GTA GAT AAA AAT GGG CTC AAG ATG TGT GAG CCT TGT GGG GGA 1074 
Met Glu Val Asp Lys Asn Gly Leu Lys Met Cys Glu Pro Cys Gly Gly 
310 315 320 325 

CTA TGT CCC AAA GCC TGT GAG GGA ACA GGC TCT GGG AGC CGC TTC CAG 1122 
Leu Cys Pro Lys Ala Cys Glu Gly Thr Gly Ser Gly Ser Arg Phe Gin 
330 335 340 

ACT GTG GAC TCG AGC AAC ATT GAT GGA TTT GTG AAC TGC ACC AAG ATC 1170 
Thr Val Asp Ser Ser Asn He Asp Gly Phe Val Asn Cys Thr Lys He 
345 350 355 

CTG GGC AAC CTG GAC TTT CTG ATC ACC GGC CTC AAT GGA GAC CCC TGG 1218 
Leu Gly Asn Leu Asp Phe Leu He Thr Gly Leu Asn Gly Asp Pro Trp 
360 365 370 

CAC AAG ATC CCT GCC CTG GAC CCA GAG AAG CTC AAT GTC TTC CGG ACA 1266 
His Lys He Pro Ala Leu Asp Pro Glu Lys Leu Asn Val Phe Arg Thr 
375 380 385 

GTA CGG GAG ATC ACA GGT TAC CTG AAC ATC CAG TCC TGG CCG CCC CAC 1314 
Val Arg Glu He Thr Gly Tyr Leu Asn He Gin Ser Trp Pro Pro His 
390 395 400 405 

ATG CAC AAC TTC AGT GTT TTT TCC AAT TTG ACA ACC ATT GGA GGC AGA 1362 
Met His Asn Phe Ser Val Phe Ser Asn Leu Thr Thr He Gly Gly Arg 
410 415 420 

AGC CTC TAC AAC CGG GGC TTC TCA TTG TTG ATC ATG AAG AAC TTG AAT 1410 
Ser Leu Tyr Asn Arg Gly Phe Ser Leu Leu He Met Lys Asn Leu Asn 
425 430 435 

GTC ACA TCT CTG GGC TTC CGA TCC CTG AAG GAA ATT AGT GCT GGG CGT 1458 
Val Thr Ser Leu Gly Phe Arg Ser Leu Lys Glu He Ser Ala Gly Arg 
440 445 450 



ATC TAT ATA AGT GCC AAT AGG CAG CTC TGC TAC CAC CAC TCT TTG AAC 1506 
lie Tyr He Ser Ala Asn Arg Gin Leu Cys Tyr His His Ser Leu Asn 
455 460 465 
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TGG ACC AAG GTG CTT CGG GGG CCT ACG GAA GAG CGA CTA GAC ATC AAG 1554 
Trp Thr Lys Val Leu Arg Gly Pro Thr Glu Glu Arg Leu Asp lie Lys 
470 475 480 485 

CAT AAT CGG CCG CGC AGA GAC TGC GTG GCA GAG GGC AAA GTG TGT GAC 1602 
His Asn Arg Pro Arg Arg Asp Cys Val Ala Glu Gly Lys Val Cys Asp 
490 495 500 

CCA CTG TGC TCC TCT GGG GGA TGC TGG GGC CCA GGC CCT GGT CAG TGC 1650 
Pro Leu Cys Ser Ser Gly Gly Cys Trp Gly Pro Gly Pro Gly Gin Cys 
505 510 515 

TTG TCC TGT CGA AAT TAT AGC CGA GGA GGT GTC TGT GTG ACC CAC TGC 1698 
Leu Ser Cys Arg Asn Tyr Ser Arg Gly Gly Val Cys Val Thr His Cys 
520 525 530 

AAC TTT CTG AAT GGG GAG CCT CGA GAA TTT GCC CAT GAG GCC GAA TGC 1746 
Asn Phe Leu Asn Gly Glu Pro Arg Glu Phe Ala His Glu Ala Glu Cys 
535 540 545 

TTC TCC TGC CAC CCG GAA TGC CAA CCC ATG GAG GGC ACT GCC ACA TGC 1794 
Phe Ser Cys His Pro Glu Cys Gin Pro Met Glu Gly Thr Ala Thr Cys 
550 555 560 565 

AAT GGC TCG GGC TCT GAT ACT TGT GCT CAA TGT GCC CAT TTT CGA GAT 1842 
Asn Gly Ser Gly Ser Asp Thr Cys Ala Gin Cys Ala His Phe Arg Asp 
570 575 580 

GGG CCC CAC TGT GTG AGC AGC TGC CCC CAT GGA GTC CTA GGT GCC AAG 1890 
Gly Pro His Cys Val Ser Ser Cys Pro His Gly Val Leu Gly Ala Lys 
585 590 595 

GGC CCA ATC TAC AAG TAC CCA GAT GTT CAG AAT GAA TGT CGG CCC TGC 1938 
Gly Pro He Tyr Lys Tyr Pro Asp Val Gin Asn Glu Cys Arg Pro Cys 
600 605 610 

CAT GAG AAC TGC ACC CAG GGG TGT AAA GGA CCA GAG CTT CAA GAC TGT 1986 
His Glu Asn Cys Thr Gin Gly Cys Lys Gly Pro Glu Leu Gin Asp Cys 
615 620 625 

TTA GGA CAA ACA CTG GTG CTG ATC GGC AAA ACC CAT CTG ACA ATG GCT 2034 
Leu Gly Gin Thr Leu Val Leu He Gly Lys Thr His Leu Thr Met Ala 
630 635 640 645 

TTG ACA GTG ATA GCA GGA TTG GTA GTG ATT TTC ATG ATG CTG GGC GGC 2082 
Leu Thr Val He Ala Gly Leu Val Val He Phe Met Met Leu Gly Gly 
650 655 660 

ACT TTT CTC TAC TGG CGT GGG CGC CGG ATT CAG AAT AAA AGG GCT ATG 2130 
Thr Phe Leu Tyr Trp Arg Gly Arg Arg He Gin Asn Lys Arg Ala Met 
665 670 675 

AGG CGA TAC TTG GAA CGG GGT GAG AGC ATA GAG CCT CTG GAC CCC AGT 2178 
Arg Arg Tyr Leu Glu Arg Gly Glu Ser He Glu Pro Leu Asp Pro Ser 
680 685 690 



1 



55 



GAG AAG GCT AAC AAA GTC TTG GCC AGA ATC TTC AAA GAG ACA GAG CTA 
Glu Lys Ala Asn Lys Val Leu Ala Arg He Phe Lys Glu Thr Glu Leu 
695 700 705 



2226 



AGG AAG CTT AAA GTG CTT GGC TCG GGT GTC TTT GGA ACT GTG CAC AAA 
Arg Lys Leu Lys Val Leu Gly Ser Gly Val Phe Gly Thr Val His Lys 
710 715 720 725 



2274 



GGA GTG TGG ATC CCT GAG GGT GAA TCA ATC AAG ATT CCA GTC TGC ATT 
Gly Val Trp He Pro Glu Gly Glu Ser He Lys He Pro Val Cys He 
730 735 740 



2322 



AAA GTC ATT GAG GAC AAG AGT GGA CGG CAG AGT TTT CAA GCT GTG ACA 
Lys Val He Glu Asp Lys Ser Gly Arg Gin Ser Phe Gin Ala Val Thr 
745 750 755 



2370 



GAT CAT ATG CTG GCC ATT GGC AGC CTG GAC CAT GCC CAC ATT GTA AGG 
Asp His Met Leu Ala He Gly Ser Leu Asp His Ala His He Val Arg 
760 765 770 



2418 



CTG CTG GGA CTA TGC CCA GGG TCA TCT CTG CAG CTT GTC ACT CAA TAT 
Leu Leu Gly Leu Cys Pro Gly Ser Ser Leu Gin Leu Val Thr Gin Tyr 
775 780 785 



2466 



TTG CCT CTG GGT TCT CTG CTG GAT CAT GTG AGA CAA CAC CGG GGG GCA 
Leu Pro Leu Gly Ser Leu Leu Asp His Val Arg Gin His Arg Gly Ala 
790 795 800 805 



2514 



CTG GGG CCA CAG CTG CTG CTC AAC TGG GGA GTA CAA ATT GCC AAG GGA 
Leu Gly Pro Gin Leu Leu Leu Asn Trp Gly Val Gin He Ala Lys Gly 
810 815 820 



2562 



ATG TAC TAC CTT GAG GAA CAT GGT ATG GTG CAT AGA AAC CTG GCT GCC 2610 
Met Tyr Tyr Leu Glu Glu His Gly Met Val His Arg Asn Leu Ala Ala 
825 830 835 

CGA AAC GTG CTA CTC AAG TCA CCC AGT CAG GTT CAG GTG GCA GAT TTT 2658 
Arg Asn Val Leu Leu Lys Ser Pro Ser Gin Val Gin Val Ala Asp Phe 
840 845 850 

GGT GTG GCT GAC CTG CTG CCT CCT GAT GAT AAG CAG CTG CTA TAC AGT 2706 
Gly Val Ala Asp Leu Leu Pro Pro Asp Asp Lys Gin Leu Leu Tyr Ser 
855 860 865 

GAG GCC AAG ACT CCA ATT AAG TGG ATG GCC CTT GAG AGT ATC CAC TTT 2754 
Glu Ala Lys Thr Pro He Lys Trp Met Ala Leu Glu Ser He His Phe 
870 875 880 885 



GGG AAA TAC ACA CAC CAG AGT GAT GTC TGG AGC TAT GGT GTG ACA GTT 
Gly Lys Tyr Thr His Gin Ser Asp Val Trp Ser Tyr Gly Val Thr Val 
890 895 900 



2802 



TGG GAG TTG ATG ACC TTC GGG GCA GAG CCC TAT GCA GGG CTA CGA TTG 
Trp Glu Leu Met Thr Phe Gly Ala Glu Pro Tyr Ala Gly Leu Arg Leu 
905 910 915 



2850 
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GCT GAA GTA CCA GAC CTG CTA GAG AAG GGG GAG CGG TTG GCA CAG CCC 2898 
Ala Glu Val Pro Asp Leu Leu Glu Lys Gly GTu Arg Leu Ala Gin Pro 
920 925 930 

CAG ATC TGC ACA ATT GAT GTC TAC ATG GTG ATG GTC AAG TGT TGG ATG 2946 
Gin He Cys Thr He Asp Val Tyr Met Val Met Val Lys Cys Trp Met 
935 940 945 

ATT GAT GAG AAC ATT CGC CCA ACC TTT AAA GAA CTA GCC AAT GAG TTC 2994 
He Asp Glu Asn He Arg Pro Thr Phe Lys Glu Leu Ala Asn Glu Phe 
950 955 960 965 

ACC AGG ATG GCC CGA GAC CCA CCA CGG TAT CTG GTC ATA AAG AGA GAG 3042 
Thr Arg Met Ala Arg Asp Pro Pro Arg Tyr Leu Val He Lys Arg Glu 

970 975 980 ! 

AGT GGG CCT GGA ATA GCC CCT GGG CCA GAG CCC CAT GGT CTG ACA AAC 3090 
Ser Gly Pro Gly He Ala Pro Gly Pro Glu Pro His Gly Leu Thr Asn 
985 990 995 

AAG AAG CTA GAG GAA GTA GAG CTG GAG CCA GAA CTA GAC CTA GAC CTA 3138 
Lys Lys Leu Glu Glu Val Glu Leu Glu Pro Glu Leu Asp Leu Asp Leu 
1000 1005 1010 

GAC TTG GAA GCA GAG GAG GAC AAC CTG GCA ACC ACC ACA CTG GGC TCC 3186 
Asp Leu Glu Ala Glu Glu Asp Asn Leu Ala Thr Thr Thr Leu Gly Ser 
1015 1020 1025 

GCC CTC AGC CTA CCA GTT GGA ACA CTT AAT CGG CCA CGT GGG AGC CAG 3234 
Ala Leu Ser Leu Pro Val Gly Thr Leu Asn Arg Pro Arg Gly Ser Gin 
1030 1035 1040 1045 

AGC CTT TTA AGT CCA TCA TCT GGA TAC ATG CCC ATG AAC CAG GGT AAT 3282 
Ser Leu Leu Ser Pro Ser Ser Gly Tyr Met Pro Met Asn Gin Gly Asn 
1050 1055 1060 

CTT GGG GAG TCT TGC CAG GAG TCT GCA GTT TCT GGG AGC AGT GAA CGG 3330 
Leu Gly Glu Ser Cys Gin Glu Ser Ala Val Ser Gly Ser Ser Glu Arg 
1065 1070 1075 

TGC CCC CGT CCA GTC TCT CTA CAC CCA ATG CCA CGG GGA TGC CTG GCA 3378 
Cys Pro Arg Pro Val Ser Leu His Pro Met Pro Arg Gly Cys Leu Ala 
1080 1085 1090 

TCA GAG TCA TCA GAG GGG CAT GTA ACA GGC TCT GAG GCT GAG CTC CAG 3426 
Ser Glu Ser Ser Glu Gly His Val Thr Gly Ser Glu Ala Glu Leu Gin 
1095 1100 1105 

GAG AAA GTG TCA ATG TGT AGA AGC CGG AGC AGG AGC CGG AGC CCA CGG 3474 
Glu Lys Val Ser Met Cys Arg Ser Arg Ser Arg Ser Arg Ser Pro Arg 
1110 1115 " 1120 1125 

CCA CGC GGA GAT AGC GCC TAC CAT TCC CAG CGC CAC AGT CTG CTG ACT 3522 
Pro Arg Gly Asp Ser Ala Tyr His Ser Gin Arg His Ser Leu Leu Thr 
1130 1135 1140 



I 
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CCT GTT ACC CCA CTC TCC CCA CCC 6GG TTA GAG GAA GAG GAT GTC AAC 3570 
Pro Val Thr Pro Leu Ser Pro Pro Gly Leu Glu Glu Glu Asp Val Asn 
1145 1150 1155 

GGT TAT GTC ATG CCA GAT ACA CAC CTC AAA GGT ACT CCC TCC TCC CGG 3618 
Gly Tyr Val Met Pro Asp Thr His Leu Lys Gly Thr Pro Ser Ser Arg 
1160 1165 1170 

GAA GGC ACC CTT TCT TCA GTG GGT CTT AGT TCT GTC CTG GGT ACT GAA 3666 
Glu Gly Thr Leu Ser Ser Val Gly Leu Ser Ser Val Leu Gly Thr Glu 
1175 1180 1185 

GAA GAA GAT GAA GAT GAG GAG TAT GAA TAC ATG AAC CGG AGG AGA AGG 3714 

Glu Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met Asn Arg Arg Arg Arg 

1190 1195 1200 " ' 1205 J 

CAC AGT CCA CCT CAT CCC CCT AGG CCA AGT TCC CTT GAG GAG CTG GGT 3762 
His Ser Pro Pro His. Pro Pro Arg Pro Ser Ser Leu Glu Glu Leu Gly 
1210 1215 1220 

TAT GAG TAC ATG GAT GTG GGG TCA GAC CTC AGT GCC TCT CTG GGC AGC 3810 
Tyr Glu Tyr Met Asp Val Gly Ser Asp Leu Ser Ala Ser Leu Gly Ser 
1225 1230 1235 

ACA CAG AGT TGC CCA CTC CAC CCT GTA CCC ATC ATG CCC ACT GCA GGC 3858 
Thr Gin Ser Cys Pro Leu His Pro Val Pro He Met Pro Thr Ala Gly 
1240 1245 1250 

ACA ACT CCA GAT GAA GAC TAT GAA TAT ATG AAT CGG CAA CGA GAT GGA 3906 
Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met Asn Arg Gin Arg Asp Gly 
1255 1260 * 1265 

GGT GGT CCT GGG GGT GAT TAT GCA GCC ATG GGG GCC TGC CCA GCA TCT 3954 
Gly Gly Pro Gly Gly Asp Tyr Ala Ala Met Gly Ala Cys Pro Ala Ser 
1270 1275 1280 1285 

GAG CAA GGG TAT GAA GAG ATG AGA GCT TTT CAG GGG CCT GGA CAT CAG 4002 
Glu Gin Gly Tyr Glu Glu Met Arg Ala Phe Gin Gly Pro Gly His Gin 
1290 1295 1300 

GCC CCC CAT GTC CAT TAT GCC CGC CTA AAA ACT CTA CGT AGC TTA GAG 4050 
Ala Pro His Val His Tyr Ala Arg Leu Lys Thr Leu Arg Ser Leu Glu 
1305 1310 1315 

GCT ACA GAC TCT GCC TTT GAT AAC CCT GAT TAC TGG CAT AGC AGG CTT 4098 
Ala Thr Asp Ser Ala Phe Asp Asn Pro Asp Tyr Trp His Ser Arg Leu 
1320 1325 1330 

TTC CCC AAG GCT AAT GCC CAG AGA ACG TAACTCCTGC TCCCTGTGGC 4145 
Phe Pro Lys Ala Asn Ala Gin Arg Thr 
1335 1340 

ACTCAGGGAG CATTTAATGG CAGCTAGTGC CTTTAGAGGG TACCGTCTTC TCCCTATTCC 4205 

CTCTCTCTCC CAGGTCCCAG CCCCTTTTCC CCAGTCCCAG ACAATTCCAT TCAATCTTTG 4265 



t 
t 
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GAGGCI 1 1 IA 


AACATTTTGA 


CACAAAATTC 


TTATGGTATG 


TAGCCAGCTG TGCACTTTCT 


4325 


TCTCTTTCCC 


AACCCCAGGA 


AAGGTTTTCC 


TTAI 1 1 IGTG 


TGCTTTCCCA GTCCCATTCC 


4385 


TCAGCTTCTT 


CACAGGCACT 


CCTGGAGATA 


TGAAGGATTA 


CTCTCCATAT CCCTTCCTCT 


4445 


CAGGCTCTTG 


ACTACTTGGA 


ACTAGGCTCT 


TATGTGTGCC 


TTTGTTTCCC ATCAGACTGT 


4505 


CAAGAAGAGG 


AAAGGGAGGA 


AACCTAGCAG 


AGGAAAGTGT 


AATTTTGGTT TATGACTCTT 


4565 


AACCCCCTAG 


AAAGACAGAA 


GCTTAAAATC 


TGTGAAGAAA 


GAGGTTAGGA GTAGATATTG 


4625 


ATTACTATCA 


TAATTCAGCA 


CTTAACTATG 


AGCCAGGCAT 


CATACTAAAC TTCACCTACA 


4685 


TTATCTCACT 


TAGTCCTTTA 


TCATCCTTAA 


AACAATTCTG 


TGACATACAT ATTATCTCAT 


4745 


TTTACACAAA 


GGGAAGTCGG 


GCATGGTGGC 


TCATGCCTGT 


AATCTCAGCA CTTTGGGAGG 


4805 


CTGAGGCAGA 


AGGATTACCT 


GAGGCAAGGA 


GTTTGAGACC 


AGCTTAGCCA ACATAGTAAG 


4865 


ACCCCCATCT 


CTTTAAAAAA 


AAAAAAAAAA 


AAAAAAAAAA 




4905 



(2) INFORMATION FOR SEQ ID N0:4: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 1342 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: protein 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:4: 

Met Arg Ala Asn Asp Ala Leu Gin Val Leu Gly Leu Leu Phe Ser Leu 
15 10 15 

Ala Arg Gly Ser Glu Val Gly Asn Ser Gin Ala Val Cys Pro Gly Thr 
20 25 30 

Leu Asn Gly Leu Ser Val Thr Gly Asp Ala Glu Asn Gin Tyr Gin Thr 
35 40 45 

Leu Tyr Lys Leu Tyr Glu Arg Cys Glu Val Val Met Gly Asn Leu Glu 
50 55 60 

He Val Leu Thr Gly His Asn Ala Asp Leu Ser Phe Leu Gin Trp He 
65 70 75 80 

Arg Glu Val Thr Gly Tyr Val Leu Val Ala Met Asn Glu Phe Ser Thr 
85 90 95 

Leu Pro Leu Pro Asn Leu Arg Val Val Arg Gly Thr Gin Val Tyr Asp 
100 105 110 



59 



Gly Lys Phe Ala He Phe Val Met Leu Asn Tyr Asn Thr Asn Ser Ser 
115 120 125 

His Ala Leu Arg Gin Leu Arg Leu Thr Gin Leu Thr Glu He Leu Ser 
130 135 140 

Gly Gly Val Tyr He Glu Lys Asn Asp Lys Leu Cys His Met Asp Thr 
145 " 150 155 160 

lie Asp Trp Arg Asp He Val Arg Asp Arg Asp Ala Glu He Val Val 
165 170 175 

Lys Asp Asn Gly Arg Ser Cys Pro Pro Cys His Glu Val Cys Lys Gly 
180 ' 185 190 

Arg Cys Trp Gly Pro Gly Ser Glu Asp Cys Gin Thr Leu Thr Lys Thr 
195 200 205 

He Cys Ala Pro Gin Cys Asn Gly His Cys Phe Gly Pro Asn Pro Asn 
210 215 220 

Gin Cys Cys His Asp Glu Cys Ala Gly Gly Cys Ser Gly Pro Gin Asp 
225 230 235 240 

Thr Asp Cys Phe Ala Cys Arg His Phe Asn Asp Ser Gly Ala Cys Val 
245 " 250 255 

Pro Arg Cys Pro Gin Pro Leu Val Tyr Asn Lys Leu Thr Phe Gin Leu 
260 265 270 

Glu Pro Asn Pro His Thr Lys Tyr Gin Tyr Gly Gly Val Cys Val Ala 
275 280 285 

Ser Cys Pro His Asn Phe Val Val Asp Gin Thr Ser Cys Val Arg Ala 
290 295 300 

Cys Pro Pro Asp Lys Met Glu Val Asp Lys Asn Gly Leu Lys Met Cys 
305 310 315 320 

Glu Pro Cys Gly Gly Leu Cys Pro Lys Ala Cys Glu Gly Thr Gly Ser 
325 330 335 

Gly Ser Arg Phe Gin Thr Val Asp Ser Ser Asn He Asp Gly Phe Val 
340 . 345 350 

Asn Cys Thr Lys He Leu Gly Asn Leu Asp Phe Leu He Thr Gly Leu 
355 360 365 

Asn Gly Asp Pro Trp His Lys He Pro Ala Leu Asp Pro Glu Lys Leu 
370 375 380 

Asn Val Phe Arg Thr Val Arg Glu He Thr Gly Tyr Leu Asn He Gin 
385 390 395 400 
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Ser Trp Pro Pro His Met His Asn Phe Ser Val Phe Ser Asn Leu Thr 
405 410 415 

Thr lie Gly Gly Arg Ser Leu Tyr Asn Arg Gly Phe Ser Leu Leu lie 
420 425 430 

Met Lys Asn Leu Asn Val Thr Ser Leu Gly Phe Arg Ser Leu Lys Glu 
435 440 445 

lie Ser Ala Gly Arg lie Tyr He Ser Ala Asn Arg Gin Leu Cys Tyr 
450 455 460 

His His Ser Leu Asn Trp Thr Lys Val Leu Arg Gly Pro Thr Glu Glu 
465 470 475 480 

Arg Leu Asp He Lys His Asn Arg Pro Arg Arg Asp Cys Val Ala Glu 
485 " 490 495 

Gly Lys Val Cys Asp Pro Leu Cys Ser Ser Gly Gly Cys Trp Gly Pro 
500 505 510 

Gly Pro Gly Gin Cys Leu Ser Cys Arg Asn Tyr Ser Arg Gly Gly Val 
515 520 525 

Cys Val Thr His Cys Asn Phe Leu Asn Gly Glu Pro Arg Glu Phe Ala 
• 530 535 540 

His Glu Ala Glu Cys Phe Ser Cys His Pro Glu Cys Gin Pro Met Glu 
545 550 555 560 

Gly Thr Ala Thr Cys Asn Gly Ser Gly Ser Asp Thr Cys Ala Gin Cys 
565 570 575 

Ala His Phe Arg Asp Gly Pro His Cys Val Ser Ser Cys Pro His Gly 
580 585 590 

Val Leu Gly Ala Lys Gly Pro He Tyr Lys Tyr Pro Asp Val Gin Asn 
595 600 605 

Glu Cys Arg Pro Cys His Glu Asn Cys Thr Gin Gly Cys Lys Gly Pro 
610 615 620 

Glu Leu Gin Asp Cys Leu Gly Gin Thr Leu Val Leu He Gly Lys Thr 
625 . 630 635 640 

His Leu Thr Met Ala Leu Thr Val He Ala Gly Leu Val Val He Phe 
645 650 655 

Met Met Leu Gly Gly Thr Phe Leu Tyr Trp Arg Gly Arg Arg He Gin 
660 665 670 



Asn Lys Arg Ala Met Arg Arg Tyr Leu Glu Arg Gly Glu Ser lie Glu 
675 680 685 
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Pro Leu Asp Pro Ser Glu Lys Ala Asn Lys Val Leu Ala Arg He Phe 
690 695 700 

Lys Glu Thr Glu Leu Arg Lys Leu Lys Val Leu Gly Ser Gly Val Phe 
705 710 715 720 

Gly Thr Val His Lys Gly Val Trp He Pro Glu Gly Glu Ser He Lys 
725 730 735 

He Pro Val Cys He Lys Val He Glu Asp Lys Ser Gly Arg Gin Ser 
740 745 750 

Phe Gin Ala Val Thr Asp His Met Leu Ala He Gly Ser Leu Asp His 
755 760 765 

Ala His lie Val Arg Leu Leu Gly Leu Cys Pro Gly Ser Ser Leu Gin 
770 775 780 

Leu Val Thr Gin Tyr Leu Pro Leu Gly Ser Leu Leu Asp His Val Arg 
785 790 795 800 

Gin His Arg Gly Ala Leu Gly Pro Gin Leu Leu Leu Asn Trp Gly Val 
805 810 815 

Gin He Ala Lys Gly Met Tyr Tyr Leu Glu Glu His Gly Met Val His 
820 825 830 

Arg Asn Leu Ala Ala Arg Asn Val Leu Leu Lys Ser Pro Ser Gin Val 
835 840 845 

Gin Val Ala Asp Phe Gly Val Ala Asp Leu Leu Pro Pro Asp Asp Lys 
850 855 860 

Gin Leu Leu Tyr Ser Glu Ala Lys Thr Pro lie Lys Trp Met Ala Leu 
865 870 875 880 

Glu Ser He His Phe Gly Lys Tyr Thr His Gin Ser Asp Val Trp Ser 
885 890 895 

Tyr Gly Val Thr Val Trp Glu Leu Met Thr Phe Gly Ala Glu Pro Tyr 
900 v 905 910 

Ala Gly Leu Arg Leu Ala Glu Val Pro Asp Leu Leu Glu Lys Gly Glu 
915 920 925 

Arg Leu Ala Gin Pro Gin He Cys Thr He Asp Val Tyr Met Val Met 
930 935 940 

Val Lys Cys Trp Met He Asp Glu Asn He Arg Pro Thr Phe Lys Glu 
945 950 955 960 



Leu Ala Asn Glu Phe Thr Arg Met Ala Arg Asp Pro Pro Arg Tyr Leu 
965 970 975 



62 

Val He Lys Arg Glu Ser Gly Pro Gly He Ala Pro Gly Pro Glu Pro 
980 985 990 

His Gly Leu Thr Asn Lys Lys Leu Glu Glu Val Glu Leu Glu Pro Glu 
995 * 1000 1005 

Leu Asp Leu Asp Leu Asp Leu Glu Ala Glu Glu Asp Asn Leu Ala Thr 
1010 1015 1020 

Thr Thr Leu Gly Ser Ala Leu Ser Leu Pro Val Gly Thr Leu Asn Arg 
1025 1030 1035 1040 

Pro Arg Gly Ser Gin Ser Leu Leu Ser Pro Ser Ser Gly Tyr Met Pro 
1045 1050 1055 

Met Asn Gin Gly Asn Leu Gly Glu Ser Cys Gin Glu Ser Ala Val Ser 
1060 1065 1070 

Gly Ser Ser Glu Arg Cys Pro Arg Pro Val Ser Leu His Pro Met Pro 
1075 " * 1080 1085 

Arg Gly Cys Leu Ala Ser Glu Ser Ser Glu Gly His Val Thr Gly Ser 
1090 1095 1100 

Glu Ala Glu Leu Gin Glu Lys Val Ser Met Cys Arg Ser Arg Ser Arg 
1105 1110 1115 1120 

Ser Arg Ser Pro Arg Pro Arg Gly Asp Ser Ala Tyr His Ser Gin Arg 
1125 1130 1135 

His Ser Leu Leu Thr Pro Val Thr Pro Leu Ser Pro Pro Gly Leu Glu 
1140 1145 1150 

Glu Glu Asp Val Asn Gly Tyr Val Met Pro Asp Thr His Leu Lys Gly 
1155 1160 1165 

Thr Pro Ser Ser Arg Glu Gly Thr Leu Ser Ser Val Gly Leu Ser Ser 
1170 1175 1180 

Val Leu Gly Thr Glu Glu Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met 
1185 " 1190 1195 1200 

Asn Arg Arg Arg Arg His Ser Pro Pro His Pro Pro Arg Pro Ser Ser 
~ " 1205 1210 1215 

Leu Glu Glu Leu Gly Tyr Glu Tyr Met Asp Val Gly Ser Asp Leu Ser 
1220 1225 1230 

Ala Ser Leu Gly Ser Thr Gin Ser Cys Pro Leu His Pro Val Pro lie 
1235 1240 1245 

Met Pro Thr Ala Gly Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met Asn 
1250 1255 1260 
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Arg Gin Arg Asp Gly Gly Gly Pro Gly Gly Asp Tyr Ala Ala Met Gly 
1265 1270 1275 1280 

Ala Cys Pro Ala Ser Glu Gin Gly Tyr Glu Glu Met Arg Ala Phe Gin 
1285 1290 1295 

Gly Pro Gly His Gin Ala Pro His Val His Tyr Ala Arg Leu Lys Thr 
1300 1305 1310 

Leu Arg Ser Leu Glu Ala Thr Asp Ser Ala Phe Asp Asn Pro Asp Tyr 
1315 1320 1325 

Trp His Ser Arg Leu Phe Pro Lys Ala Asn Ala Gin Arg Thr 
1330 1335 1340 

(2) INFORMATION FOR SEQ ID N0:5: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 5: 

Glu Asp Glu Asp Glu Glu Tyr Glu Tyr Met Asn Arg Arg Arg Arg 
15 10 15 

(2) INFORMATION FOR SEQ ID NO:6: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:6: 

Thr Thr Pro Asp Glu Asp Tyr Glu Tyr Met Asn Arg Gin Arg Asp 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO:7: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
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. (v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO:7: 

Thr Glu Glu Arg Leu Asp He Lys His Asn Arg Pro Arg Arg Asp 
1 5 10 15 

(2) INFORMATION FOR SEQ ID NO: 8: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 



(xi) SEQUENCE DESCRIPTION: SEQ ID N0:8: 

Arg Ser Arg Ser Arg Ser Arg Ser Pro Arg Pro Arg Gly Asp Ser 
15 10 15 

(2) INFORMATION FOR SEQ ID N0:9: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 15 amino acids 

(B) TYPE: amino acid 
(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 

(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID N0:9: 

Tyr Met Asn Arg Arg Arg Arg His Ser Pro Pro His Pro Pro Arg 
15 10 15 

(2) INFORMATION FOR SEQ ID NO: 10: 

(i) SEQUENCE. CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 10: 
GATCTCGAGT CGAC 
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(2) INFORMATION FOR SEQ ID NO: 11: 

(i) SEQUENCE CHARACTERISTICS: 

(A) LENGTH: 14 base pairs 

(B) TYPE: nucleic acid 

(C) STRANDEDNESS: single 

(D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: DNA (genomic) 
(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 11 
GATCGTCGAC TCGA 

(2) INFORMATION FOR SEQ ID NO: 12: 

(i) SEQUENCE CHARACTERISTICS: 
(A) LENGTH: 5 amino acids 
B) TYPE: amino acid 
D) TOPOLOGY: linear 

(ii) MOLECULE TYPE: peptide 
(v) FRAGMENT TYPE: internal 

(xi) SEQUENCE DESCRIPTION: SEQ ID NO: 12 

Tyr Glu Tyr Met Asn 
1 5 



